# Academic document processing

PP FormulaNet Plus M
Apache-2.0
PP-FormulaNet_plus-M is an enhanced formula recognition model developed by the PaddleOCR team. It supports Chinese formula recognition and improves the processing ability for complex formulas.
Text Recognition Supports Multiple Languages
P
PaddlePaddle
154
0
PP FormulaNet Plus L
Apache-2.0
PP-FormulaNet_plus-L is an enhanced formula recognition model developed by the PaddleOCR team. It supports Chinese formula recognition, and the maximum number of tokens is increased to 2560, making it suitable for complex formula scenarios.
Text Recognition Supports Multiple Languages
P
PaddlePaddle
954
0
Typress Ocr
MIT
A pre-trained TrOCR model specifically designed for Typst formula OCR tasks, capable of converting mathematical formulas in images into text format.
Text Recognition Transformers
T
paran3xus
88
2
Texteller
Apache-2.0
TexTeller is an end-to-end formula recognition model based on the ViT architecture, capable of recognizing mathematical formulas in natural images and converting them into LaTeX format.
Text Recognition Transformers
T
OleehyO
3,806
31
Texify
Texify is an OCR tool specifically designed to convert formula images and text into LaTeX format.
Text Recognition Transformers
T
vikp
206.53k
15
Nougat Latex Base
Apache-2.0
This model is a LaTeX OCR model fine-tuned based on Nougat-base, specifically designed to generate LaTeX code from images, with a particular optimization for the recognition ability of mathematical formula images.
Image-to-Text Transformers English
N
Norm
8,523
78
Nougat Base
Nougat is a model based on the Donut architecture, specifically trained for transcribing scientific PDFs into easy-to-use Markdown format
Image-to-Text Transformers
N
facebook
8,151
164
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase